Recent studies show that, despite being effective on numerous tasks, text processing algorithms may be vulnerable to deliberate attacks. However, the question of whether such weaknesses can directly lead to security threats is still under-explored. To bridge this gap, we conducted vulnerability tests on Text-to-SQL, a technique that builds natural language interfaces for databases. Empirically, we showed that the Text-to-SQL modules of two commercial black boxes (Baidu-UNIT and Codex-powered Ai2sql) can be manipulated to produce malicious code, potentially leading to data breaches and Denial of Service. This is the first demonstration of the danger of NLP models being exploited as attack vectors in the wild. Moreover, experiments involving four open-source frameworks verified that simple backdoor attacks can achieve a 100% success rate on Text-to-SQL systems with almost no prediction performance impact. By reporting these findings and suggesting practical defences, we call for immediate attention from the NLP community to the identification and remediation of software security issues.
translated by 谷歌翻译
使用计算机视觉对间接费用的分析是一个问题,在学术文献中受到了很大的关注。在这个领域运行的大多数技术都非常专业,需要大型数据集的昂贵手动注释。这些问题通过开发更通用的框架来解决这些问题,并结合了表示学习的进步,该框架可以更灵活地分析具有有限标记数据的新图像类别。首先,根据动量对比机制创建了未标记的空中图像数据集的强大表示。随后,通过构建5个标记图像的准确分类器来专门用于不同的任务。从6000万个未标记的图像中,成功的低水平检测城市基础设施进化,体现了我们推进定量城市研究的巨大潜力。
translated by 谷歌翻译
跨语言嵌入技术(CLWE)的技术在应对低资源语言的自然语言处理挑战方面起着基本作用。它的主要方法假设嵌入之间的关系可以由线性映射表示,但是没有探索该假设所存在的条件。这种研究差距最近变得非常危急,因为已经证明,放松映射是非线性的,在某些情况下可以提高性能。我们首次提出了一个理论分析,该分析将单词嵌入中编码的类比保存是一种必要且充分的条件,用于在这些嵌入之间的地面clwe映射是线性的。在一个涵盖十二种不同语言的五个代表性类比类别的新型跨语性类比数据集中,我们进行了实验,为我们的理论主张提供直接的经验支持。这些结果提供了对其他研究人员的观察结果的更多见解,并为制定更有效的跨语性代表性学习策略做出了贡献。
translated by 谷歌翻译
目的:开发和验证一种自动化方法,用于对新生儿重症监护病房中睡眠状态波动的床旁监测。方法:基于深度学习的算法是使用30个近期新生儿的长期(a)脑电图监测的53个EEG录音设计和训练的。使用来自30个多摄影记录的外部数据集对结果进行了验证。除了训练和验证单个脑电图通道安静的睡眠探测器外,我们还构建了睡眠状态趋势(SST),这是一种可视化分类器输出的床旁准备手段。结果:训练数据中安静的睡眠检测的准确性为90%,在4电极记录中获得的所有双极派生中,精度是可比的(85-86%)。该算法很好地概括了外部数据集,尽管信号推导不同,但仍显示81%的总体精度。 SST允许对分类器输出的直观,清晰可视化。结论:可以从单个EEG通道的高保真度中检测到睡眠状态的波动,并且可以将结果可视化为床边监视器中透明和直观的趋势。意义:睡眠状态趋势(SST)可以为护理人员提供对睡眠状态波动及其周期性的实时视图。
translated by 谷歌翻译
由于数据保护法和机构内的官方程序,在实践中很难在机构之间共享医疗数据。因此,大多数现有的算法经过相对较小的脑电图(EEG)数据集的培训,这可能会损害预测准确性。在这项工作中,我们通过将公开可用的数据集分配到代表各个机构中数据的不相交集中来共享数据时模拟了一个情况。我们建议在每个机构中培训一个(本地)检测器,并将其个人预测汇总为最终预测。比较了四个集合计划,即多数投票,平均值,加权平均值和Dawid-Skene方法。该方法仅使用EEG通道的一个子集在独立的数据集上进行了验证。当每个机构提供足够数量的数据时,合奏的精度与对所有数据进行训练的单个检测器相当。加权平均聚合方案表现出最佳性能,当局部检测器接近对所有可用数据训练的单个检测器的性能时,它只能用DAWID-SKENE方法略有优于。
translated by 谷歌翻译
LIDAR(“光检测和测距”或“激光成像,检测和测距”)技术可用于提供城市和农村景观的详细三维高度地图。迄今为止,空气传播的激光雷达成像主要被限制在环境和考古域中。然而,该数据的地理上粒度和开放源特性也为使用了地理人口类型数据的社会,组织和业务应用程序。具体地,处理该多维数据的复杂性迄今为止涉及其更广泛的采用。在本文中,我们提出了一系列方便的任务无关瓷砖高程嵌入来解决这一挑战,利用无监督深度学习的最新进展。通过预测大伦敦地区的小型地区,通过预测七个剥夺指数(2019年)来测试我们嵌入的潜力。这些索引涵盖了一系列社会经济结果,并作为可以应用嵌入的各种下游任务的代理。我们考虑不仅仅是独立于自己的数据的适用性,而且与人口统计特征结合使用,也可以作为辅助数据源,从而为嵌入品提供了一个现实用例。在尝试各种模型/嵌入配置中,我们发现我们最好的表现嵌入式导致单独使用标准人口统计特征的根本平衡(RMSE)改进高达21%。我们还展示了使用深度学习与K-Means集群相结合的嵌入管道的嵌入管道,产生相干瓷砖段,允许解释潜在的嵌入功能。
translated by 谷歌翻译
Research on automated essay scoring has become increasing important because it serves as a method for evaluating students' written-responses at scale. Scalable methods for scoring written responses are needed as students migrate to online learning environments resulting in the need to evaluate large numbers of written-response assessments. The purpose of this study is to describe and evaluate three active learning methods than can be used to minimize the number of essays that must be scored by human raters while still providing the data needed to train a modern automated essay scoring system. The three active learning methods are the uncertainty-based, the topological-based, and the hybrid method. These three methods were used to select essays included as part of the Automated Student Assessment Prize competition that were then classified using a scoring model that was training with the bidirectional encoder representations from transformer language model. All three active learning methods produced strong results, with the topological-based method producing the most efficient classification. Growth rate accuracy was also evaluated. The active learning methods produced different levels of efficiency under different sample size allocations but, overall, all three methods were highly efficient and produced classifications that were similar to one another.
translated by 谷歌翻译
This paper presents a novel framework for planning in unknown and occluded urban spaces. We specifically focus on turns and intersections where occlusions significantly impact navigability. Our approach uses an inpainting model to fill in a sparse, occluded, semantic lidar point cloud and plans dynamically feasible paths for a vehicle to traverse through the open and inpainted spaces. We demonstrate our approach using a car's lidar data with real-time occlusions, and show that by inpainting occluded areas, we can plan longer paths, with more turn options compared to without inpainting; in addition, our approach more closely follows paths derived from a planner with no occlusions (called the ground truth) compared to other state of the art approaches.
translated by 谷歌翻译
Feature acquisition algorithms address the problem of acquiring informative features while balancing the costs of acquisition to improve the learning performances of ML models. Previous approaches have focused on calculating the expected utility values of features to determine the acquisition sequences. Other approaches formulated the problem as a Markov Decision Process (MDP) and applied reinforcement learning based algorithms. In comparison to previous approaches, we focus on 1) formulating the feature acquisition problem as a MDP and applying Monte Carlo Tree Search, 2) calculating the intermediary rewards for each acquisition step based on model improvements and acquisition costs and 3) simultaneously optimizing model improvement and acquisition costs with multi-objective Monte Carlo Tree Search. With Proximal Policy Optimization and Deep Q-Network algorithms as benchmark, we show the effectiveness of our proposed approach with experimental study.
translated by 谷歌翻译
The celebrated proverb that "speech is silver, silence is golden" has a long multinational history and multiple specific meanings. In written texts punctuation can in fact be considered one of its manifestations. Indeed, the virtue of effectively speaking and writing involves - often decisively - the capacity to apply the properly placed breaks. In the present study, based on a large corpus of world-famous and representative literary texts in seven major Western languages, it is shown that the distribution of intervals between consecutive punctuation marks in almost all texts can universally be characterised by only two parameters of the discrete Weibull distribution which can be given an intuitive interpretation in terms of the so-called hazard function. The values of these two parameters tend to be language-specific, however, and even appear to navigate translations. The properties of the computed hazard functions indicate that among the studied languages, English turns out to be the least constrained by the necessity to place a consecutive punctuation mark to partition a sequence of words. This may suggest that when compared to other studied languages, English is more flexible, in the sense of allowing longer uninterrupted sequences of words. Spanish reveals similar tendency to only a bit lesser extent.
translated by 谷歌翻译